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Abstract. For many random Constraint Satisfaction Problems, by now, we have asymptotically tight estimates of 
the largest constraint density for which they have solutions. At the same time, all known polynomial- time algorithms 
for many of these problems already completely fail to find solutions at much smaller densities. For example, it is 
well-known that it is easy to color a random graph using twice as many colors as its chromatic number. Indeed, some 
of the simplest possible coloring algorithms already achieve this goal. Given the simplicity of those algorithms, one 
would expect there is a lot of room for improvement. Yet, to date, no algorithm is known that uses (2 — e)x colors, 
in spite of efforts by numerous researchers over the years. 

In view of the remarkable resilience of this factor of 2 against every algorithm hurled at it, we believe it is natural to 
inquire into its origin. We do so by analyzing the evolution of the set of fc-colorings of a random graph, viewed as a 
subset of {1, . . . , k} n , as edges are added. We prove that the factor of 2 corresponds in a precise mathematical sense 
to a phase transition in the geometry of this set. Roughly, the set of fc-colorings looks like a giant ball for k > 2x, but 
like an error-correcting code for fc < (2 — e)\- We prove that a completely analogous phase transition also occurs 
both in random fc-SAT and in random hypergraph 2-coloring. And that for each problem, its location corresponds 
precisely with the point were all known polynomial-time algorithms fail. To prove our results we develop a general 
technique that allows us to prove rigorously much of the celebrated 1-step Replica-Symmetry-Breaking hypothesis 
of statistical physics for random CSPs. 
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1 Introduction 



For many random Constraint Satisfaction Problems (CSP), such as random graph coloring, random fc-SAT, 
random Max fc-SAT, and hypergraph 2-coloring, by now, we have asymptotically tight estimates for the 
largest constraint density for which typical instances have solutions (see [5]). At the same time, all known 
efficient algorithms for each problem fair very poorly, i.e., they stop finding solutions at constraint densities 
much lower than those for which we can prove that solutions exist. Adding insult to injury, the best known 
algorithm for each problem asymptotically fairs no better than certain extremely naive algorithms for the 
problem. 

For example, it has been known for nearly twenty years [10] that the following very simple algorithm 
will find a satisfying assignment of a random /c-CNF formula with m = rn clauses for r = 0(2 k /k): if 
there is a unit clause satisfy it; otherwise assign a random value to a random unassigned variable. While it 
is known that random /c-CNF remain satisfiable for r = 0(2 k ), no polynomial-time algorithm is known to 
find satisfying assignments for r = (2 k /k) ■ oj(k) for some function uo(k) — > oo. 

Similarly, for all k > 3, the following algorithm [18,2] will /c-color a random graph with average degree 
d < kink: select a random vertex with fewest available colors left and assign it a random available color. 
While it is known that random graphs remains /c-colorable for d ~ 2 k In k, no polynomial-time algorithm is 
known that can fc-color a random graph of average degree (1 + e)k In k for some fixed e > and arbitrarily 
large k. Equivalently, while it is trivial to color a random graph using twice as many colors as its chromatic 
number, no polynomial-time algorithm is known that can get by with (2 — e)\ colors, for some fixed e > 0. 

Random A;-SAT and random graph coloring are not alone. In fact, for nearly every random CSP of 
interest, the known results establish a completely analogous state of the art: 

1 . There is a trivial upper bound on the largest constraint density for which solutions exist. 

2. There is a non-constructive proof, usually via the second moment method, that the bound from (1) is 
essentially tight, i.e., that solutions do exist for densities nearly as high as the trivial upper bound. 

3. Some simple algorithm finds solutions up to a constraint density much below the one from (2). 

4. No polynomial-time algorithm is known to succeed for a density asymptotically greater than that in (3). 

In this paper we prove that this is not a coincidence. Namely, for random graph coloring, random A;-SAT, 
and random hypergraph 2-coloring, we prove that the point where all known algorithms stop is precisely 
the point where the geometry of the space of solutions undergoes a dramatic change. This is known as a 
"dynamical" phase transition in statistical physics and our results establish rigorously for random CSPs a 
large part of the "1-step Replica Symmetry Breaking" hypothesis [20]. Roughly speaking, this hypothesis 
asserts that while the set of solutions for low densities looks like a giant ball, at some critical point this ball 
shatters into exponentially many pieces that are far apart from one another and separated by huge "energy 
barriers". Algorithms (even extremely simple ones) have no problem finding solutions in the "ball" regime, 
but no algorithm is known that can find solutions in the "error-correcting code" regime. 

We believe that the presence of dynamical phase transitions in random CSPs is a very general phe- 
nomenon, whose qualitative characteristics should be problem-independent, i.e., universal. The fact that 
we can establish the exact same qualitative picture for a problem with binary constraints over k-wy vari- 
ables (random graph £>coloring) and a problem with fc-ary constraints over binary variables (hypergraph 
2-colorability) certainly lends support to this notion. That said, we wish to emphasize that determining for 
each random CSP the location of its dynamical phase transition (as we do in this paper for the three problems 
mentioned, in order to show that the transition coincides with the demise of all known algorithms) requires 
non-trivial, problem-specific ideas and computations. 



Perhaps the following is an intuitive model of how a dynamical phase transition comes about. In random 
graph coloring, rather than thinking of the number of available colors as fixed and the constraint density 
(number of edges) as increasing, imagine that we keep the constraint density fixed, but we keep decreasing 
the number of available colors. If we start with q available colors where q 3> x, it is reasonable to imagine 
that the set of valid g-colorings, viewed as a subset of {1,2,..., q} n , has a nice "round" shape, the rounder 
the greater q is relative to x- By the same token, when we restrict our attention to the set of those g-colorings 
that only use colors {1, 2, . . . , q — 1}, we are taking a "slice' of the set of ^-colorings. With each slicing 
the connectivity of the set at hands deteriorates, until at some point the set shatters. For example, slicing the 
2-dimensional unit sphere through the origin yields a circle, but slicing the circle, yields a pair of points. 

We conclude the introduction with a few words about the technical foundation for our work. To prove 
the existence (and determine the location) of a dynamical phase transition one needs access to statistical 
properties of the uniform measure over solutions. A geometric way of thinking about this is as follows. Given 
a CSP instance, say a /c-CNF formula with m clauses chosen uniformly at random, consider the function H 
on {0, l} n that assigns to each truth assignment the number of clauses it violates. In this manner, H defines 
a "landscape" in which satisfying assignments correspond to valleys at sea-level. Understanding statistical 
properties of the uniform measure over solutions amounts to understanding "the view" one enjoys from such 
a valley, a probabilistically formidable task. As we discuss in Section 4, we can establish the following: the 
number of solutions of a random CSP is sufficiently concentrated around its exponentially large expectation 
for the view from a random sea-level valley to be "the same" as the view from an "artificial" valley. That 
is, from the valley that results by first selecting a random a G {0, l} n and then forming a random /c-CNF 
formula, also with m clauses, but now chosen uniformly among the clauses satisfied by a, i.e., the view 
from the planted satisfying assignment in the planted model. This is a much easier view to understand and 
we believe that the "transfer" theorems we establish in this paper will significantly aid in the analysis of 
random CSPs. 

2 Statement of Results 

To present our results in a uniform manner we need to introduce some common notions. Let V be a set of 
n variables, all with the same domain D, and let C be an arbitrary set of constraints over the variables in 
V. A CSP instance is a subset of C. We let dist(<r, r) denote the Hamming distance between a, r G D n and 
we turn D n into a graph by saying that a, r are adjacent if dist(<r, r) = 1. For a given instance /, we let 
H = Hj : D n — > N be the function counting the number of constraints of I violated by each a G D n . 

Definition 1. The height of a path cfq, a%, . . . ,o~t £ D n is maxj H(o~i). We say that a G D n is a solution of 
an instance I, if H(a) = 0. We will denote by S{I) the set of all solutions of an instance I. The clusters of 
an instance I are the connected components ofS(I). A region is a non-empty union of clusters. 

Remark 1. The term cluster comes from physics. Requiring dist(er, r) = 1 to say that a, r are adjacent is 
somewhat arbitrary (but conceptually simplest) and a number of our results hold if one replaces 1 with o(n). 

We will be interested in distributions of CSP instances as the number of variables n grows. The set 
C = C n will typically consist of all possible constraints of a certain type, e.g., the set of all (?) possible 
hyperedges in the problem of 2-coloring random /c-uniform hypegraphs. We let I n ,m denote the set of all 
CSP instances with precisely m distinct constraints from C n and we letX njm denote the uniform distribution 
on the set of all instances I n ,m- We will say that a sequence of events £ n holds with high probability (w.h.p.) 
if limn^oo Pr[£ n ] = 1 and with uniformly positive probability (w.u.p.p.) if liminfn^oo Pr[£ n ] > 0. As per 
standard practice in the study of random structures, we will take the liberty of writing T n>m to denote the 
underlying random variable and, thus, write things like "The probability that 5(X njm )..." 



2.1 Shattering 



Definition 2. We say that the set of solutions ofI n ^ m shatters if there exist constants (3, 7, £, 6 > such 
that w.h.p. S(I ntTn ) can be partitioned into regions so that: 

1. The number of regions is at least e@ n . 

2. Each region contains at most an e _7n fraction of all solutions. 

3. The Hamming distance between any two regions is at least £n. 

4. Every path between vertices in distinct regions has height at least 9n. 

Our first main result asserts that the space of solutions for random graph coloring, random fc-SAT, and 
random hypergraph 2-colorability shatters and that this shattering occurs just above the largest density for 
which any polynomial-time algorithm is known to find solutions for the corresponding problem. Moreover, 
we prove that the space remains shattered until, essentially, the CSP's satisfiability threshold. More precisely: 

- A random graph with average degree d, i.e., m = dn/2, is w.h.p. fc-colorable for d < (2 — ^ k )klnk, 
where j k — > 0. The best poly-time fc-coloring algorithm w.h.p. fails for d > (1 + 8 k )k In k, where S k — > 0. 

Theorem 1. There exists a sequence e k — ► 0, such that the space of k-colorings of a random graph with 
average degree d shatters for all 

(1 + e k )k\nk < d < (2 - e k )klnk . (1) 

- A random k-CNF formula with n variables and rn clauses is w.h.p. satisfiable for r < 2 k In 2 — k. The 
best poly-time satisfiability algorithm w.h.p. fails for r > 2 k+l /k. In [23], non-rigorous, but mathematically 
sophisticated evidence is given that a different algorithm succeeds for r = 0((2 k /k) In k), but not higher. 

Theorem 2. There exists a sequence e k — > such that the space of satisfying assignments of a random 
k-CNE formula with rn clauses shatters for all 

(l + e fc )^ InA: <r < (l-e fc )2 fc ln2 . (2) 

- A random /c-uniform hypergraph with n variables and rn edges is w.h.p. 2-colorable for r < 2 k ~ l In 2 — |. 
The best poly-time 2-coloring algorithm w.h.p. fails for r > 2 k jk. In [23], non-rigorous, but mathematically 
sophisticated evidence is given that a different algorithm succeeds for r = 0((2 k /k) In A;), but not higher. 

Theorem 3. There exists a sequence e k — > such that the space of 2-colorings of a random k-uniform 
hypergraph with rn edges shatters for all 

2 k ~ 1 

(1 + e k ) — In k < r < (1 - e k )2 k ^ In 2 . (3) 

Remark 2. As the notation in Theorems 1,2,3 is asymptotic in k, the stated intervals may be empty for small 
values of k. In this extended abstract we have not optimized the proofs to deliver the smallest values of k for 
which the intervals are non-empty. Quick calculations suggest k > 6 for hypergraph 2-colorability, k > 8 
for fc-SAT, and k > 20 for /c-coloring. 



2.2 Rigidity 

The regions mentioned in Theorems 1, 2 and 3 can be thought of as forming an error-correcting code in 
the solution-space of each problem. To make this precise we need to introduce the following definition and 
formalize the notion of "a random solution of a random instance". 

Definition 3. Given an instance I, a solution a G <S(I) and a variable v G V, we say that v in (I, a): 

— Is f{n)-rigid, if every r G S(I) such that t(v) ^ o~{v) has dist{a, r) > f(n). 

— Is f{n)-loose, if for every j G D, there exists t G S(I) such that t(v) = j and dist{a, r) < f(n). 

We will prove that while before the phase transition, in a typical solution, every variable is loose, after 
the phase transition nearly every variable is rigid. To formalize the notion of a random/typical solution, recall 
that I n>m denotes the set of all instances with m constraints over n variables and let A = A riim denote the 
set of all instance-solution pairs, i.e., A Thrn = {(I, a) : I G I n ,-m, & G We let U = U n)m be the 

probability distribution induced on A n ^ m by the following: 

Choose an instance / G I n ,m uniformly at random. 
If SiT) ^ 0, select a G S(T) uniformly at random. 

We will refer to instance-solution pairs generated according to U n ^ m as uniform instance-solution pairs. We 
note that although the definition of uniform pairs allows for S(I) to be typically empty, i.e., to be in the 
typically unsatisfiable regime, we will only employ the definition for constraint densities such that w.h.p. 
S{I) contains exponentially many solutions. Hence, our liberty in also using the term a "typical" solution. 

Theorem 4. Let (I, a) be a uniform instance-solution pair where: 

— I is a graph with dn/2 edges, where d is as in (1), and a is a k-coloring of I, or, 

— I is a k-CNF formula with rn clauses, where r is as in (2), and a is a satisfying assignment of I, or, 

— I is a k-uniform hypergraph with rn edges, where r is as in (3), and a is a 2-coloring of I. 

W.h.p. the number of rigid variables in (I, a) is at least Jk n > fo r some sequence 7^ — ► 1. 

Remark 3. Theorem 4 is tight since for every finite constraint density, a random instance w.h.p. has I7(n) 
variables that are not bound by any constraint. 

The picture drawn by Theorem 4, whereby nearly all variables are rigid in typical solutions above the 
dynamical phase transition, is in sharp contrast with our results for densities below the transition for graph 
coloring and hypergraph 2-colorability. While we believe that an analogous picture holds for A;-SAT, see 
Conjecture 1, for technical reasons we cannot establish this presently. (We discuss the additional difficulties 
imposed by random /c-SAT in Section 4.) 

Theorem 5. Let (I, a) be a uniform instance-solution pair where: 

— I is a graph with dn/2 edges, where d < (1 — ej^jk \n.k, and a is a k-coloring of I, or, 

— I is a k-uniform hypergraph with rn edges, where r < (1 — e&) (2 k ~ 1 /k) In k, and a is a 2-coloring of I. 

There exists a sequence — > such that w.h.p. every variable in (I, a) is o{n)-loose. 

We note that in fact, for all d and r as in Theorem 5, w.u.p.p. (I, a) is such that changing the color of 
any vertex to any color only requires changing the color of 0(log n) other vertices. 

Conjecture I. Let (/, a) be a uniform instance-solution pair where / is a /c-CNF formula with rn clauses, 
where r < (1 — ef i )(2 k /k) In k, and a is a satisfying assignment of /. There exists a sequence e& — ► such 
that w.h.p. every variable in (I, a) is o(n)-loose. 



3 Background and Related Work 



3.1 Algorithms 

Attempts for a "quick improvement" upon either of the naive algorithms mentioned in the introduction for 
satisfiability/graph coloring, stumble upon the following general fact. Given a CSP instance, consider the 
bipartite graph in which every variable is adjacent to precisely those constraints in which it appears, known 
as the factor graph of the instance. For random formulas/graphs, factor graphs are locally tree-like, i.e., for 
any arbitrarily large constant D, the depth- D neighborhood of a random vertex is a tree w.h.p. In other words, 
locally, random CSPs are trivial, e.g., random graphs of any finite average degree are locally 2-colorable. 
Moreover, as the constraint density is increased, the factor graphs of random CSPs get closer and closer to 
being biregular, so that degree information is not useful either. Combined, these two facts render all known 
algorithms impotent, i.e., as the density is increased, their asymptotic performance matches that of trivial 
algorithms. 

In [22], Mezard, Parisi, and Zecchina proposed a new satisfiability algorithm called Survey Propagation 
(SP) which performs extremely well experimentally on instances of random 3-SAT. This was very surprising 
at the time and allowed for optimism that, perhaps, random k-S AT instances might not be so hard. Moreover, 
SP was generalized to other problems, e.g., fc-coloring [9] and Max fc-SAT [8]. An experimental evaluation 
of SP for values of k even as small as 5 or 6 is already somewhat problematic, but to the extent it is reliable 
it strongly suggests that SP does not find solutions for densities as high as those for which solutions are 
known to exist. Perhaps more importantly, it can be shown that for densities at least as high as 2 k In 2 — k, 
if SP can succeed at its main task (approximating the marginal probability distribution of the variables with 
respect to the uniform measure over satisfying assignments), so can a much simpler algorithm, namely 
Belief Propagation (BP), i.e., dynamic programming on trees. 

The trouble is that to use either BP or SP to find satisfying assignments one sets variables iteratively. 
So, even if it is possible to compute approximately correct marginals at the beginning of the execution (for 
the entire formula), this can stop being the case after some variables are set. Concretely, in [23], Montanari 
et al. showed that (even within the relatively generous assumptions of statistical physics computations) the 
following Gibbs-sampling algorithm fails above the (2 k /k) In k barrier, i.e., step 2 below fails to converge 
after only a small fraction of all variables have been assigned a value: 

1. Select a variable v at random. 

2. Compute the marginal distribution of v using Belief Propagation. 

3. Set v to {0, 1} according to the computed marginal distribution; simplify the formula; go to step 1. 

3.2 Relating the Uniform and the Planted Model. 

The idea of deterministically embedding a property inside a random structure is very old and, in general, 
the process of doing this is referred to as "planting" the property. In our case, we plant a solution a in a 
random CSP, by only including constraints compatible with a. Juels and Peinado [19] were perhaps the 
first to explore the relationship between the planted and the uniform model and they did so for the clique 
problem in dense random graphs G n i/ 2 , i.e., where each edge appears independently with probability 1/2. 
They showed the distribution resulting from first choosing G = G n ^/2 and then planting a clique of size 
(1 + e) log 2 n is very close to G njl / 2 and suggested this as a scheme to obtain a one-way-function. Since 
the planted clique has size only (1 + e) log 2 n, the basic argument in [19] is closely related to subgraph 
counting. In contrast, the objects under consideration in our work (fc-colorings, satisfying assignments, etc.) 



have an immediate impact on the global structure of the combinatorial object being considered, rather than 
just being local features, such as a clique on 0(log n) vertices. 

Coja-Oghlan, Krivelevich, and Vilenchik [12, 13] proved that for constraint densities well above the 
threshold for the existence of solutions, the planted model for /c-coloring and fc-SAT is equivalent to the 
uniform distribution conditional on the (exponentially unlikely) existence of at least one solution. In this 
conditional distribution as well as in the high-density planted model, the geometry of the solution space is 
very simple, as there is precisely one cluster of solutions. 

3.3 Solution-space Geometry 

In [7,21] the first steps were made towards understanding the solution-space geometry of random /c-CNF 
formulas by proving the existence of shattering and the presence of rigid variables for r = 0{2 k ). This was 
a far cry from the true r ~ (2 fc /k) In A; threshold for the onset of both phenomena, as we establish here. 
Besides the quantitative aspect, there is also a fundamentally important difference in the methods employed 
in [7,21] vs. those employed here. In those works, properties were established by taking a union bound 
over all satisfying assignments. It is not hard to show that the derived results are best possible using those 
methods and, in fact, there is good reason to believe that the results are genuinely tight, i.e., that for densities 
o(2 k ) the derived properties simply do not hold for all satisfying assignments. Here, we instead establish 
a systematic connection between the planted model and the process of sampling a random solution of a 
random instance. This argument allows us to analyze "typical" solutions while allowing for the possibility 
that a (relatively small, though exponential) number of "atypical" solutions exist. Therefore, we are for the 
first time in a position to analyze the extremely complex energy landscape of below-threshold instances of 
random CSPs, and to estimate quantities that appeared completely out of reach prior to this work. 

4 Our Point of Departure: Symmetry, Randomness and Inversion 

As mentioned, the results in this paper are enabled by a set of technical lemmas that allow one to reduce 
the study of "random solutions of random CSP instances" to the study of "planted CSP solutions". The 
conceptual origin of these lemmas can be traced to the following humble observation. 

Let M be an arbitrary 0-1 matrix with the property that all its rows have the same number of Is and all 
its columns have the same the number of Is. A moment's reflection makes it clear that for such a matrix, 
both of the following methods select a uniformly random 1 from the entire matrix: 

1. Select a uniformly random column and then a uniformly random 1 in that column. 

2. Select a uniformly random row and then a uniformly random 1 in that row. 

An example of how we employ this fact for random CSPs is as follows. Let T be the set of all /c-CNF 
formulas with n variables and m distinct clauses (chosen among all 2 fe (^) possible £>clauses). Say that 
a G {0, l} n NAE-satisfies a formula F G T if under a, every clause of F has at least one satisfied and at 
least one falsified literal. Let M be the 2 n x \F\ matrix whereM^ = 1 iff a G {0, l} n NAE-satisfies F. By 
the symmetry of T , it is clear that all rows of M have the same number of Is. Imagine, for a moment, that 
the same was true for all columns. Then, a uniformly random solution of a uniformly random instance would 
be distributed exactly as a "planted" instance-solution pair: first select a G {0, l} n uniformly at random; 
then select m distinct clauses uniformly at random among all 2 k ~ l (^) clauses NAE-satisfied by a. 

Our contribution begins with the realization that exact row- and column-balance is not necessary. Rather, 
it is enough for the Is in M to be "well-spread". More precisely, it is enough that the marginal distributions 



induced on the rows and columns of M by selecting a uniformly random 1 from the entire matrix are both 
"reasonably close to" uniform. For example, assume we can prove that Q{\ F\) columns of M have Q(f(n)) 
Is, where f(n) is the average number of Is per column. Indeed, this is precisely the kind of property implied 
by the success of the second moment method for random NAE-fc-SAT [3]. Under this assumption, proving 
that a property holds w.u.p.p. for a uniformly random solution of a uniformly random instance, reduces to 
proving that it holds w.h.p. for the planted solution of a planted instance, a dramatically simpler task. 

There is a geometric intuition behind our transfer theorems which is more conveniently described when 
every constraint is included independently with the same probability p, i.e., we take p = m/ (2 fe (^)). For 
all k > 3 and m = rn, it was shown in [3] that the resulting instances w.u.p.p. have exponentially many 
solutions for r < 2 k ~ l In 2 — 3/2. Consider now the following way of generating planted NAE fc-SAT 
instances. First, select a formula F by including each clause with probability p, exactly as above. Then, 
select a £ {0, 1}" uniformly at random and remove from F all constraints violated by a. Call the resulting 
instance F' . Our results say that as long as q = r(l — 2~ k+1 ) < 2 k ~ 1 In 2 — 3/2, the instance F' is "nearly 
indistinguishable" from a uniform instance created by including each clause with probability q. (We will 
make this statement precise shortly.) 

To see how this happens, recall the function H : a — > N counting the number of violated constraints 
under each assignment. Clearly, selecting F specifies such a function Hp, while selecting a £ {0, l} n and 
removing all constraints violated by a amounts to modifying Hp so that Hp (a) = 0. One can imagine that 
such a modification creates a gradient in the vicinity of a, a "crater" with a at its bottom. What we prove is 
that as long as Hp already had an exponential number of craters and the number of craters is concentrated, 
adding one more crater does not make a big difference. Of course, if the density is increased further, the 
opened crater becomes increasingly obvious, as it takes a larger and larger cone to get from the typical 
values of Hp down to 0. Hence the ease with which algorithms solve planted instances of high density. 

To prove our transfer theorems we instantiate this idea for random graph fc-coloring, random /c-uniform 
hypergraph 2-coloring, and random /c-SAT. For this, a crucial step is deriving a lower bound on the number 
of solutions of a random instance. For example, in the case of random graph /c-coloring, we prove that the 
number of fc-colorings, |5(/ nim )|, for a random graph with n vertices and m edges is "concentrated" around 
its expectation in the sense that w.h.p. 

n" 1 |ln|5(/ n , m )| -lnE(|5(/ n , m )|)| = o(l) . (4) 

To prove this, we use the upper bound on the second moment E [|5(I njm )| 2 ] from [4] to show that w.u.p.p. 
\S(I n ,m)\ — ^(E|5(/ n m )|). Then, we perform a sharp threshold analysis, using theorems of Friedgut [15], 
to prove that (4) holds, in fact, with high probability. A similar approach applies to hypegraph 2-coloring. 

The situation for random fc-SAT is more involved. Indeed, we can prove that the number of satisfying 
assignments is not concentrated around its expectation in the sense of (4). This problem is mirrored by 
the fact that the second moment of the number of satisfying assignments exceeds the square of the first 
moment by an exponential factor (for any constraint density). Nonetheless, letting Fk(n, rn) denote a uni- 
formly random /c-CNF formula with n variables and m clauses, combining techniques from [6] with a sharp 
threshold analysis, we can derive a lower bound on the number of satisfying assignments that holds w.h.p., 
namely n _1 In \S{F^{n, m))\ > n~ l lnE|5(Ffc(n, m))\ — 4>{k), where <p(k) — > exponentially with k. 
This estimates allows us to approximate the uniform model by the planted model sufficiently well in order 
to establish Theorems 2 and 4. 



5 Proof sketches 



Due to the space constraints, in the remaining pages we give proof sketches of our results for fc-coloring, 
to offer a feel of the transfer theorems and of the style of the arguments one has to employ given those the- 
orems (actual proofs appear in the Appendix). The proofs for hypergraph 2-coloring are relatively similar, 
as it is also a "symmetric" CSP and the second moment methods works on its number of solutions. For 
fc-SAT, though, a significant amount of additional work is needed, as properties must be established with ex- 
ponentially small error probability to overcome the large deviations in the number of satisfying assignments 
(proofs appear in the Appendix). 

5.1 Transfer Theorem for Random Graph Coloring 

We consider a fixed number e > and assume that k > ko for some sufficiently large ko = ko(e). We 
denote {1, . . . , k} as [k]. We are interested in the probability distribution U Thm on A ra m resulting from first 
choosing a random graph G = G(n, m) and then a random /c-coloring of G (if one exists). To analyze this 
distribution, we consider the distribution V n ,m on A niin induced by following experiment. 

PI. Generate a uniformly random ^-partition a £ [k] n . 

P2. Generate a graph G with m edges chosen uniformly at random among the edges bicolored under a. 
P3. Output the pair (G, a). 

The distribution V n>m is known as the planted model. 

Theorem 6. Suppose that d = 2m /n < (2 — e)klnk. There exists a Junction f(n) = o{n) such that the 
following is true. Let T> he any graph property such that G(n, m) has T> with probability 1 — o(l), and let 
£ be any property of pairs [G, a) £ A n ^ m . If for all sufficiently large n 

p rPn, m [(G, a) has £\G has V] > 1 - exp(-/(n)), (5) 

then Pr Wn m [(G, a) has £ ] = 1 - o(l). 

5.2 Loose Variables Below the Transition 

Suppose that d < (1 — e)k In k. Recall that a graph with vertex set V is said to be (-choosable if for any 
assignments of color lists of length at least £ to the elements of V, there is a proper coloring in which every 
vertex receives a color from its list. To prove Theorem 5, we consider the property £ that all vertices are 
o(n)-loose and the following condition V: 

For any set S C V of size |5| < g(n) the subgraph induced on S is 4-choosable. 

Here gin) is some function such that f{n) <C g{n) = o(n), where f{n) is the function from Theorem 6. A 
standard argument shows that a random graph G(n, m), where m = 0{n), satisfies V w.h.p. 

By Theorem 6, we are thus left to establish (5). Let a € [k] n be a uniformly random k -partition, and let 
G be a random graph with m edges such that a is a /c-coloring of G. Since a is uniformly random, we may 
assume that the color classes Vi = cr~ l (i) satisfy \Vi\ ~ n/k. Let v o G V be any vertex, and let / ^ o~(vq) 
be the "target color" for vq. Our goal is to find a coloring r such that t{vq) = I and dist(<r, r) < g(n). 

If v has no neighbor in Vi, then we can just assign this color to vq. Otherwise, we run the following 
process. In the course of the process, every vertex is either awake, dead, or asleep. Initially, all the neighbors 
of vq in Vi are awake, v is dead, and all other vertices are asleep. In each step of the process, pick an awake 



vertex w arbitrarily and declare it dead (if there is no awake vertex, terminate the process). If there are 
at least five colors c\(w), . . . , c^{w) available such that w has no neighbor in V c .^ w y then we do nothing. 
Otherwise, we pick five colors c\{w), . . . , c${w) randomly and declare all asleep neighbors of w in V Cj ^ 
awake for 1 < j < 5. 

Lemma 1. With probability at least 1 — exp(— f(n)) there are at most g(n) dead vertices when the process 
terminates. 

The proof of Lemma 1 is based on relating our process to a subcritical branching process. The basic insight 
here is that when d < (1 — e)k In k it is very likely that a vertex w has five immediately available colors. 
More precisely, for any w the number of neighbors in any class Vi with i ^ a(w) is asymptotically Poisson 
with mean (1 + o(l)) h^\ n < (1 — £ + o(l)) ■ Hence, the probability that w does not have a neighbor 
in Vi is about k 6 ^ 1 . As there are k colors in total, we expect about (k — l) e colors available for w, i.e., a lot. 

To obtain a new coloring r in which vo takes color I we consider the set D of all dead vertices. We let 
t(u) = <j{u) for all u G V \ D. Moreover, conditioning on the event V, we can assign to each w G D 
a color from the list {c\(w), . . . , 05(10)} \ {/}. Thus, the new coloring r differs from a on at most \D\ < 
g(n) = o(n) vertices. 

5.3 Rigid Variables Above the Transition 

Suppose that d > (1 + e)klnk. To prove Theorem 4 for coloring we apply Theorem 7 as follows. We let 
a, j3 > be sufficiently small numbers and denote by 6 the following property of a pair (G, a) G A n<m \ 

There is a subgraph G* C G of size | V(G*)| > (1 — a)n such that for every vertex v of G* and 
each color i / a(v) there are at least (3 In k vertices vertex w in G* that are adjacent to v such that (6) 

a{w) = i. 

Also, we let V be the property that the maximum degree is at most (In n) 2 . 

We shall prove that for a pair (G, a) chosen from U n>m a subgraph G* as in (6) exists w.h.p. If that is 
so, then every vertex in G* has at least one neighbor in every color class other than its own. Therefore, it 
is impossible to just assign a different color to any vertex in G*. In fact, since all vertices in G* have a lot 
(namely, at least /3 In k) of neighbors with every other color, the expansion properties of the random graph 
G(n,m) imply that recoloring any vertex v in G* necessitates the recoloring of at least n/(klnk) further 
vertices. Loosely speaking, the conflicts resulting from recoloring v spread so rapidly that we necessarily 
end up recoloring a huge number of vertices. Thus, all vertices in G* are n/(k In /c)-rigid. Note that we can 
not hope for much better, as we can always recolor v by swapping two color classes, i.e., ~ 2n/k vertices. 

To prove the existence of the subgraph G*, we establish the following. 

Lemma 2. Condition (5) holds for D and £ as above. 

To obtain Lemma 2, let (G, a) G A n . m be a random pair chosen from the distribution V n ,m- We may 
assume that |cr _1 (i)| ~ n/k for all i. To obtain the graph G*, we perform a "stripping process". As a first 
step, we obtain a subgraph H by removing from G all vertices that have fewer than 7 In A; neighbors in any 
color class other than their own. If 7 = 7(e) is sufficiently small, then the expected number of vertices 
removed in this way is less than nk~ s for a 5 > 0, because for each vertex w the expected number of 
neighbors in another color class is bigger than (1+e) In k. Then, we keep removing vertices from H that have 
"a lot" of neighbors outside of H. Given the event V, we then show that with probabiltiy 1 — exp(— Q(n)) 
the final result of this process is a subgraph G* that satisfies (6). 



5.4 Proof of Theorem 1 



Theorem 1 concerns the "view" from a random coloring a of G(n, m). Basically, our goal is to show that 
only a tiny fraction of all possible colorings are "visible" from a, i.e., a lives in a small, isolated valley. To 
establish the theorem, we need a way to measure how "close" two colorings a, r are. The Hamming distance 
is inappropriate here because two colorings a, r can be at Hamming distance n, although r simply results 
from permuting the color classes of a, i.e., although a and r are essentially identical. Instead, we shall use 
the following concept. Given two coloring a, r, we let M CT|T = (Ma, T )i<i j<k De the matrix with entries 

To measure how close r is to a we let 

k 

Ur) = \\M^ T \\ 2 F =Y,(M l J,r) 2 , 

be the squared Frobenius norm of AI aT . Observe that this quantity reflects the probability that a single 
random edge is monochromatic under both a and r, i.e., the conelation of a and r, precisely as desired. 
Hence, f a is a map from the set [k] n of A: -partitions to the interval [k~ 2 , / CT (<r)] , where f a (a) > k~ l . Thus, 
the larger / CT (r), the more r resembles a. Furthermore, for a fixed a G S(G) and a number A > we let 

9*,G,x{x) = \{r G [k] n : / CT (r) = x A H(r) < Xn}\. 

In order to show that S{G n ^ m ) with m = rn decomposes into exponentially many regions, we employ 
the following lemma. 

Lemma 3. Suppose that r > (i + £k)k In k. There are numbers k~ 2 < y\ < y2 < k" 1 and A, 7 > such 
that with high probability a pair (G, a) G A n>m chosen from the distributoin lA n)m has the following two 
properties. 

1. For all x G [2/1,2/2] we have g a ,G,x( x ) — 0. 

2. The number of colorings r G S(G) such that / CT (r) > 1/2 is at most exp(-jn) ■ \S(G)\. 

Let G = G n! m be a random graph and call a G S(G) good if both (1) and (2) hold. Then Lemma 3 states 
that w.h.p. a 1 — o(l)-fraction of all a G S(G) are good. Hence, to decompose S(G) into regions, we 
proceed as follows. For each a G S(G) we let C a = {t G S(G) : /o-(t) > 2/2}- Then starting with the set 
S = S(G) and removing iteratively some C a for a good a G S yields an exponential number of regions. 
Furthermore, each such region C a is separated by a linear Hamming distance from the set S(G) \C a , because 
f a is "continuous" with respect to n" 1 x Hamming distance. Thus, Theorem 1 follows from Lemma 3. 
Finally, by Theorem 6, to prove Lemma 3 it is sufficient to show the following. 

Lemma 4. Suppose that r > (i + £k)k In k. There are k~ 2 < y\ < 2/2 < k" 1 and A, 7 > such that with 
probability at least 1 — exp(— Q{n)) a pair (G, a) G A n _ m chosen from the distributoin V n .m has the two 
properties stated in Lemma 3. 

The proof of Lemma 4 is based on the "first moment method". That is, for any k~ 2 < y < k^ 1 we compute 
the expected number of assignments r G [k] n such that / CT (r) = y and H{t) < An. This computation is 
feasible in the planted model and yields similar expressions as encountered in [4] in the course of computing 
the second moment of the number of fc-colorings. Therefore, we can show that the expected number of such 
assignments r is exponentially small for a regime yi < y < 2/2. whence Lemma 21 follows from Markov's 
inequality. 
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In this appendix we present the proofs of our results for graph coloring and random fc-SAT, thereby 
presenting the most important techniques. We omit the proofs for hypergraph 2-coloring, as these are similar 
to but simpler than the A;-SAT proofs, due to the fact that the transfer theorem for hypergraph 2-coloring is 
as strong as that for fc-coloring. We generally assume that n is sufficiently large. 

A Graph coloring 
A.l The planted model 

In this section we consider a fixed number e > and assume that k > ko for some sufficiently large 
ko = ko(e). We are interested in the probability distribution U n>m on A n ^ m . To analyze this distribution, we 
consider the distribution V n ,m on Ai,m induced by following experiment ("planted model"). 

PI. Generate a uniformly random ^-partition a £ [k] n . 

P2. Generate a graph G with m edges chosen uniformly at random among the edges bicolored under a. 
P3. Output the pair (G, a). 

Theorem 7. Suppose that d = 2m /n < (2 — e)klnk. There exists a function f(n) = o(n) such that the 
following is true. Let T> be any graph property such that G(n, m) has T> with probability 1 — o(l), and let 
£ be any property of pairs (G, a) G A n>m . If for all sufficiently large n we have 

Prp„, m [{G, a) has £\G has V] > 1 - exp(-/(n)), (7) 

then Pr Wn m [(G, a) has £ ] = 1 — o(l). 

For a given assignment a G [k] n we let G(a) be the set of all graphs with m edges for which a is a 
proper coloring. Then it is immediate that 

El<Kj<* W-Hi)\\ = ((n 2 - Ell k- X «| 2 )/2\ _ 
m J \ m J 

Hence 

A = m ax| G W|<p-WQY 
v \ m J 

Lemma 5. There is a constant p > such that the following is true. Let a € [k] n be chosen uniformly at 
random. Then Pr [|G(c)l — pM — P- 

Proof. Let7 > be sufficiently small. Moreover, let rii = |<7 _1 (i)|, 8% = rij — n/k, and N = (1 — k~ l )(^j. 
Then J2i $i = 0- Therefore, 

i i 

Since for a random a the numbers ni are multinomially distributed, with probability we have |n.,— ?| < 
ypynjk. Hence, letting N(a) = Yl%<j n i n j> we conclude that there is a constant p > such that 

Pr [N(a) > N - jn] > p. 




Thus, by Stirling's formula with probability at least p we have 

m J\mJ ~2 \ N J \l-m/N(a)J v ' ' 

Since N = Q{n 2 ) and m = 0{n), in the case N(a) > N — 771 we have 

N W\ > (1 - in /N) m > 
>i, 



N 

1 - m/iV 



1 - m/N(a) 
(l-m/N) N ^~ N > f2(l). 

Hence, choosing p > sufficiently small, we can ensure that Pr [|G(cr)| > pX] > p. □ 

Corollary 1. We have |Ai,m| > p 2 k n X. 

Lemma 6. Suppose that there is a number ( > smc/z ?/zctf Pr-p n m [£\C] < exp(— £n). Then 

|{(G, a) G A n>m n £ : G G C}| < p~ 2 exp(-Cn)K,m|- 

Proof. We have 

Prp„, m [f AC] 



exp(-Cn) > Pr P „ im [£|C] 



Prp n , m [C] 

\{GeG(a) : (G, <r) g£aGgC)}| 



\G(a)\ 

erg [ft] 

> A-^^Pr^^ \{(G, a) G vl re ,™ n £ : G G C)}| , 
because |G(cr)| < A for all a. Hence, Corollary 1 yields 

exp(-Cn) > yf— • |{(G, a) G A n>m n £ : G G C)}\ , 

as desired. □ 

Let /x = E(|5(G njm )|) be the expected number of fc-colorings of G n m . Combining the second moment 
argument from [4] with arguments from [1], we obtain the following result (see Appendix A.2). 

Lemma 7. There is a function f(n) = o(n) such that \S(G n ^ m )\ > exp(—f(n))p with high probability. 

Proof of Theorem 7. Assume that a random pair (G, a) chosen according to the uniform model has £ with 
probability at least 2£, while Prp n m [£\C] < exp(— Qn) for an arbitrarily small ( > 0. Since G n>m has the 
property C w.h.p., we conclude that 

Pr Wn , m [£\C] > (. 

Therefore, Lemma 7 entails that 

& = |{(G, ff )£4 m n^:GeC}| >Cexp(-/(n))|Avn|- 
Since /(n) = o(n), this contradicts Lemma 6. □ 



A.2 Proof of Lemma 7 



To prove Lemma 7, we combine the second moment argument from [4] with a sharp threshold argument. 
Let G = G(n, m) be a random graph and let X be the number of balanced colorings of G, i.e., colorings 
a € [k] n such that |cr~ 1 (i) — n/k\ < 1 for all 1 < i < k. Recall that S(G) denotes the set of all A;-colorings 
of G. A direct computation involving Stirling's formula shows that 

E(X) > Q{n^ k ' 2 )E\S{G)\. 

In addition, [4, Section 3] shows that there is a constant C = C(k) such that 

E(X 2 ) < C(k)E(X) 2 . 

Applying the Paley-Zigmund inequality, we thus conclude that there is a number a = a(k) > such that 

Pr [X > aE(X)] > a, 

whence 



Pr 



> a. 



\S(G)\ > Q{n- k / 2 )E\S{G)\ 
In addition, E|5(G)| is easily computed: we have 

n -1 lnE|<S(G)| = lnfc + rln(l - k~ l ) + o(l), 
where r = m/n. Thus, we obtain the following. 

Lemma 8. Let £(k, r) = In k + r ln(l - k' 1 ). Then Pr [n~ l In \S(G(n, m))\ > £(k, r) - o(l)] > a. 

To complete the proof of Lemma 7, we combine Lemma 8 with a shaip threshold result. Let A% be the 
property that a graph G on n vertices has less than £ n fc-colorings. 

Lemma 9. For any fixed £ > the property A% has a sharp threshold. That is, there is a sequence r n such 
that for any e > we have 

lim Pr \G(n, (1 — e) \r n n] ) does not have A] = 1 — lim Pr [G(n, (1 + e) \r n n] ) has A] = 0. 

n— >oo n— >oo 

We shall prove Lemma 9 in Appendix A.3. Lemma 7 is an immediate consequence of Lemmas 8 and 9. 



A.3 Proof of Lemma 9 

The property A^ is monotone under the addition of edges. Therefore, it is sufficient to prove that A^ has a 
sharp threshold in the random graph G(n,p), in which edges are added with probability p independently. 
Let J\f = £ n . The argument builds upon [1]. We denote the set of /c-colorings of a graph G by S(G). 

Lemma 10. Let M 1 be some number ( that may depend on n ), and let < t < 1 be fixed. Further, let 
M = 0(1) be an integer. Suppose thatPr[\S(G(n,p))\ <M'\ < 1 — r. Moreover, assume that for a list of 
colors ci, . . . , cm £ {1, ■ ■ ■ ,k} the following is true: if we pick M vertices v\, . . . , vm independently and 
uniformly at random, then with probability > l — t/2 the random graph G(n,p) has at most N' k-colorings 
in which Vi receives a color different from Cifor all 1 < i < M. Then with probability > 1 — t/2 + o(l) 
the random graph G(n,p) has at most M' k-colorings in which Vi receives a color different from c^for all 
1< i < M - 1. 



Proof. Let a; be a (very) slow growing function of n. Moreover, let £ j be the event that the first i constraints: 
u Vj must not receive color cf for 1 < j < i, cause the number of /c-colorings to be at most J\f' . Then given 
that G = G(n,p) has more than J\f' A;-colorings (i.e., coditional on the event £q), the probability of £m is at 
least \. Hence, conditional on £q, we have 

Pr [S M -i] + Pr [£m\£m-i] Pr [£ M -l] > \- 

We consider two cases: if Pr [£ M -i] > \ ~ then we are done. Hence, assume that Pr [£ m—i] < 
^ — u~ l . Then Pr [<?a/|£a/_i] > u~ l . Note that Pr [£m \£~m-i] is the fraction of vertices to such that 
forbidding color cm at w reduces the number of colorings to at most M' (after the addition of the first M — I 
constraints). We call such vertices w good, and denote the set of colorings that w spoils by Z w and its size 
by z w = \Z W \. Let us consider two cases. Let y be the number of colorings of G that respect the first M — 1 
constraints. We consider two cases. In each of these two cases we shall prove that adding a small number of 
random edges to G(n,p) reduces the number of colorings that respect the first M — 1 constraints to at most 
A/ 7 with probability at least 1 — w~ 3 . 

Case 1: y < luN'. Since for each of the y colorings the probability that a new random edge spoils this 
coloring is fc -1 , we can reduce the number of colorings to at most Af' by adding u random edges (use 
Markov's inequality). 

Case 2: y > loM' . If w, w' are good, then \ Z W n Z w /\ > y — 2J\f'. Therefore, adding an edge between two 
good vertices causes the number of colorings to drop to at most 2J\f'. Furthermore, the probability that 
a random edge joins two good vertices is Pr [£m \£~m-i] 2 > oj~ 2 . Therefore, after adding a; 10 edges, 
we have reduced the number of proper colorings to at most 2J\f' with probability > 1 — w" 4 . Finally, 
adding an additional w 10 edges reduces the number of colorings to at most J\f' by the same argument as 
in Case 1. 

Now, note that instead of first imposing the M — 1 constraints w\, . . . , wm-i and then adding the random 
edges as in Cases 1 and 2 we could first add a set of 2uj 10 random edges to G(n,p). As uj 10 is of smaller 
order than the standard deviation of the number of edges of G(n,p), the resulting distribution is within 
o(l) from the originial distribution G(n,p) in total variation distance. Therefore, we conclude that actually 
just imposing the M — 1 constraints wi,. . . ,wm-i suffices to increase the probability of having < J\f' 
/c-colorings to 1 — r/2 + o(l). □ 

Corollary 2. LetM 1 be some number {may depend on n), and let < t < 1 be fixed. Further, let M = O(l) 
be an integer. Suppose that Pr [\S(G(n,p))\ < M'] < 1 — r. Then there is no list of colors c±, . . . , cm £ 
{1, . . . , k} such that the following is true: if we pick M vertices v\, . . . ,vm independently and uniformly 
at random, then with probability > 1 — t/2 the random graph G(n,p) has at most M' k-colorings in which 
Vi receives a color different from Cifor all 1 < i < M. 

Proof. Applying the lemma M times, we can reduce the number of constraints that is necessary to reduce 
the number of colorings to < M 1 to 0. □ 

To prove that has a sharp threshold, we assume for contratiction that this is not so. Hence, there exists 
an edge probability p* such that the probability that G niP * has ^4^ is exactly equal to 1 — t for a small t > 0. 
Further, by [15, Theorem 2.4] there exists a fixed graph R on r vertices such that with probability > 1 — t/3 
the following is true. If we first pick G = G riiP * and then insert a random copy of R into G, then the resulting 
graph has A%. Furthermore, this graph R is /c-colorable. In fact, by monotonicity we may assume that R is 



uniquely fc-colorable. The experiment of inserting a random copy of R into G n ^ p * is actually equivalent to 
the following (because G„ iP * is symmetric with respect to vertex permutations). We let Gr denote a random 
graph obtained by first inserting a copy of R into the first r vertices v±, . . . ,v r , and then adding edges with 
probability p* independently (among all n vertices v%, . . . , v n ). Then the probability that Gr has A% is at 
least 1 — 2/3. Hence, 

Pv[\S(G R )\ <M] > 1-2/3, 



while by the choice of p* 



Pr [\S(G n , p *)\ > Af] > t > 0. 



(8) 



(9) 



Let G signify the subraph of Gr induced on the n — r vertices u r +i, 
implies that 



Pr 



\S(6)\ > k~ r M 



> t. 



, v n . Then G = G n -r,p*, and (9) 

(10) 



Furthermore, we can relate the £>colorings of Gr and the A;-colorings of G as follows. Let Q be the set of 
edges from the set {v±, . . . , v r } to {tv+i, • • • , v n }. Then w.h.p. \Q\ = 0(1) and no vertex in {v r+ i, . . . , v n } 
is incident to more than one edge in Q. Furthermore, since R admits a unique A;-coloring, each edge in Q for- 
bids its endpoint in {iv+i, • • • , v n } exactly one color. Hence, the edges in Q impose constraints c±, . . . , cm 
on M = \Q\ randomly chosen vertices wi, . . . , wm as in Lemma 10. Therefore, (8) implies that 



Pr 



G + M random constraints has at most J\f /c-colorings > 1 — 2/3. 



But then Corollary 2 implies that 



Pr 



\S(G)\ < M > 1-2/3. 



Furthermore, as we may add another In n random edges to G without shifting the distribution by more than 
o(l) in total variation distance, and since each of these In n edges reduces the expected number of colorings 
by A;" 1 , Markov's inequality entails that 



Pr 



S{G)\ < k~ r M > l-2/3-o(l), 



which contradicts (10). 



A.4 Proof of Theorem 5 

Suppose that d < (1 — e)k \nk, and that k > ko(e) for a sufficiently large ko(e). Let q = 5 and recall that a 
graph is ^-choosable if for any assignments of color lists of length at least £ to the vertices of the graph there 
is a proper coloring such that each vertex receives a color from its list. To prove Theorem 5, we consider the 
property £ that all vertices are loose and the following condition V: 

For any set S C V of size |5| < g(n) the subgraph induced on 5 is (q — l)-choosable. 

Here q > is a constant and g(n) = \/nf(n) = o(n), where f(n) is the function from Theorem 7. 

Lemma 11. With high probability the random graph G(n, m) satisfies T>. 



Proof. Since m = 0(n), this follows from a standard first moment argument. 



□ 



By Theorem 7, we just need to establish (7). Thus, let a G [k] n be a coloring such that the color classes 
Vi = cr~ l {i) satisfy \Vi\ ~ n/k, and let G be a random graph with m edges such that a is a /c-coloring of 
G. Let t>o £ V" be any vertex; without loss of generality we may assume that ct(vq) = 1. In addition, let 
1 < I < k be the "target color" for vq. If vq has no neighbor in Vi, then we can just assign this color to vq. 

Otherwise, we run the following process. In the course of the process, every vertex is either awake, 
dead, or asleep. Initially, all the neighbors of vq in Vi are awake, v is dead, and all other vertices are asleep. 
In each step of the process, pick an awake vertex w arbitrarily and declare it dead (if there is no awake 
vertex, terminate the process). If there are at least q colors ci(w), . . . , c q (w) such that w has no neighbor in 
V c .r w \, then we do nothing. Otherwise, we pick q colors ci(w), . . . , c q {w) randomly and declare all asleep 
neighbors of w in V c . ^ awake for 1 < j < q. 

Lemma 12. With probability at least 1 — exp(— /(n)) there are at most g{n) dead vertices when the process 
terminates. 

Proof. We show that the aforementioned process is dominated by a branching process in which the expected 
number of successors is less than one. Then the assertion follows from Chernoff bounds. 

To set up the analogy, note that the expected number of neighbors of any w G V \ Vi in Vi is asymptoti- 
cally 2m/k < j5| • In A;. Hence, the probability that w has no neighbor in Vi is at least k e l 2 ~ x . Therefore, 
the expected number of classes i ^ a(w) in which w has no neighbor is at least (k — l)k e l 2 ~ l > k £ / 3 . 
Furthermore, the number of such classes is asymptotically binomially distributed. Therefore, assuming that 
k is sufficiently large, we conclude that the probability that there are less than q classes in which w has no 
neighbor is less than kT 1 . Given that this is so, the number of neighbors of w in each of the q chosen classes 
ci(w), . . . , c q (w) has mean at most 2 In k. Therefore, the expected number of newly awake vertices resulting 
from w is at most 2/c" 1 In A;. Thus, the above process is dominated by a branching process with successor 
rate 2fe _1 Ink < 1. Therefore, the assertion follows from stochastic domiance and Chernoff bounds. □ 

Proof of Theorem 5. Let S be the set of dead vertices left by the aforementioned process. By Lemma 12 we 
may assume that \ W\ < g{n). Hence, conditioning on V, we may assume that S is (q — l)-choosable. Now, 
we assign lists of colors to the vertices in S as follows. The list of vo just consists of its target color /. To 
any other w G W we assign the list L w = {ci(w), . . . , c q (w)} \ {I}. Now, we can color the subgraph G [S] 
by assigning color / to v and a color from L w to any other w G W. We extend this to a coloring of G by 
assigning color a(u) to any u G V \ W. Let r signify the resulting coloring. 

We claim that r is a proper coloring of G. For both the subgraph induced on W and the subgraph induced 
on V \ W are properly colored. Moreover, by construction no w G W \ {v} is adjacent to a vertex of color 
Cj(w) in V\W. Finally, a and r are at Hamming distance at most \ W\ < g(n) = o{n). Hence, the assertion 
follows from Theorem 7. □ 

A.5 Rigid variables 

Let a,e > 0, and assume that k > ko(s) for a large enough ko(e, a) > 0. Suppose that (1 + e)klnk < 
d = 2m /n < (2 — e) k In k. To prove Theorem 4 for coloring, we apply Theorem 7 as follows. We let (3 = 
(3(e, a) > be a sufficiently small number and denote by £ the following property of a pair (G, a) G A n ^ m . 

There is a subgraph G* C G of size > (1 — a)n such that for every vertex v of and 

each color i ^ a(v) there are at least (3 In k vertices vertex w in G* that are adjacent to v such that (11) 

a(w) = i. 

Also, we let V be the property that the maximum degree is at most In 2 n. 



Lemma 13. Condition (7) is satisfied with D and £ as above. 



Proof of Theorem 4 for coloring. Given a random coloring a of a random graph G = G(n, m), Lemma 13 
and Theorem 7 imply that w.h.p. there is a subgraph G* satisfying (11). In addition, we assume that G has 
the following property. 

There is no set S C V of size |5| < n/(k Ink) that spans at least \S\ ^ In k edges. (12) 

A standard 1st moment argument shows that (12) holds in G(n, m) w.h.p. 

Assume for contradiction that there is another coloring r such that the set U = {v G G* : a(v) / t(v)} 
has size \U\ < n/(kln k). Let Uf = {» £ G» : t(v) = i / a(v)} and C/T = {« £ G» : a(v) = i / r(v)}. 
Then 

fc k 

i^i = X>ri = X>ri- («) 

i=l i=l 

Every u £ G» \ 7j has at least /3 mA; neighbors in G* n cj _1 (i). Hence, if u G Xlf , then all of these 
neighbors lie inside of U7~ . We claim that this implies that | \ < | U[~ \ . For assume that Xjf > U~ and 
set S = U TJ-. Then |5| < \U\ < n/(khik), and S spans at least |5|f In A; ed ges, in contradiction 
to (12). Thus, we conclude that \U^\ < \U~\ for all i, in contradiction to (13). Hence, all the vertices in G* 
are 17 (n) -rigid. □ 



A.6 Proof of Lemma 13 

Let(G,a) G be a random pair chosen from the distribution "P nm . We may assume that | cr~ 1 («) | ^n/k 
for all i and let Vi = a" 1 (i). To simplify the analysis, we shall replace the random graph G, which has afixed 
number m of edges, by a random graph G' in which is obtained by including each edge {v, w} with a(v) ^ 
a(w) with probability p independently. Here p is chosen so that the expected number Yli<i<j<k \Vi\'\Vj\'P 
of edges of G' equals m. 

Lemma 14. For any property Q we have Pr [G has Q\T>] < 0(y/n ■ Pr [G' has Q\T>]). 

We defer the proof to Section A.7. 

Thus, in the sequel we will work with G' rather than G. Let 7 = 7(e) > be a sufficiently small 
number, and let Vi = <r _1 (z). Moreover, for a vertex v and a set Z C V let e(v, Z) signify the number of 
w-Z-edges in G'. We construct a subgraph G* of G' as follows. 

1. Let W i:j = {v£V l : e(v, Vj) < -/Ink}, W % = (jj=i Wy, and W = (J- = i W { . 

2. Let = {v G V« : e(«, Wj) > | In A;} and C7 = [J ¥ , ^- 

3. Let Z = U. While there is a vertex u G V \ Z that has at least 10 neighbors in Z, add v to Z. 

4. Let G* = G'- U ti W< - ^. 

For each vertex v £ Vi and each color j ^ i the expected number of neighbors of u with color i is 
|Vi| • — ~ (1 + 2e) In A;. Hence, the sets Wy contain those vertices form Vi that have a lot fewer neighbors 
with color j than expected. 

Lemma 15. There is a number (3 = (3(e) > such that with probability > 1 — exp(— Q(n)) we have 
\Wij\ < nk~ 2 ~^ for any Hence, \W{\ < nk~ l ~P, and \W\ < nk" 13 . 



Proof. In the random graph G' for each v E Vi the number e(v,Vj) is binomially distributed. Hence, the 
probability that e(v, Vj) < 7 In k is at most exp(— (1 + e') Ink), where e' > depends only on e and 
7. Furthermore, as in G' edges occur independently, \Wij\ is binomially distributed as well (with mean 
< £ • exp(— (1 + e') In A;)). Therefore, the assertion follows from Chernoff bounds. □ 

Each of the vertices in U has a lot of neighbors in the small set W. Therefore, since the random graph 
G' is a good expander, we expect U to be much smaller than W . 

Lemma 16. Given that T> occurs, with probability at least 1 — exp(— Q(n)) the set U contains at most 
nk~ 7 vertices. 

We postpone the proof to Section A.8. 

Lemma 17. With probability > 1 — exp( — Q{n)) the set Z contains at most nk~ e vertices. 

Proof. Assume that this is not the case. Let Y contain all vertices of U and the first nk~ 6 — \U\ vertices 
added to Z by step 3 of the construction of G*. Then \Y\ < nk~ G and e(Y) > 9\Y\. However, a simple 
first moment argument shows that the probability that a set Y with these two properties is present in G' is at 
most < exp(— Q(n)). □ 

Combining Lemma 15, 16, and 17, we conclude that G* contains at least n(l — a) vertices (provided 
that k is sufficiently larger). Moreover, the construction of ensures that this graph satisfies (11). 



A.7 Proof of Lemma 14 

Given that G' has exactly m edges, G' is just a uniformly random graph with planted coloring Vi, . . . , V^. 
That is, given that the number of edges is m, G' is identically distributed to G. Therefore, 

r L as I J - Pr [G 1 has V] 

nt u Pr [G has both V and V] 
^ Q{U " 2) - Pr^hasP] • (14) 

Furthermore, since m = 0(n), with probability 1 — o(l) the maximum degree of G as well as of G' is at 
most Inn. Therefore, G, G' have T> with probability 1 — o(l). Hence, (14) yields 

Pr [G' has > fl(„-i) • ^ [g has both p and P] = pr [(? ^ 

rr [Gr has i^J 

as claimed. 

A.8 Proof of Lemma 16 

To analyze the sets Uu from the second step of the construction of G*, consider 

U' a = {v eVi : e(v,Wi \ W a ) > ^mfc}, 
^ = {«e^:e(t7,Wi,) > Jlnfc}. 



Lemma 18. With probability > 1 — exp(— fi(n)) we have \U' a \ < nk 



-10 



Proof. The definition of the set Wi \ Wu depends solely on the Vi-V \ VJ-edges. Therefore, the VJ-V^-edges 
are indepenent of the random set Wi \ Wu, which with probability > 1 — exp(— Q(n)) has size < nk~ l ~P 
by the Lemma 15. Assuming that this is indeed the case, we conclude that for any vertex v G V\ the number 
e(v, W{ \ Wu) is binomially distributed with mean npk~ l ~@ < 2k~@ In k. Hence, the probability that v has 
z = jink neighbors inside Wi \ Wu is at most 



i 

'nk~ l ~P 



z 



Thus, E|{7^| < nk 10 /2. Finally, as \U' U \ is binomially distributed, the assertion follows from Chernoff 
bounds. □ 



Lemma 19. Conditional on the event D, with probability > 1 — exp(— Q(n)) we have \U^\ < nk 



-10 



Proof. We just need to analyze the bipartite subgraph G' [Vi U Vr}. The set Wu consists of all vertices v G Vi 
that have degree < 7 In A; in this subgraph. To investigate G' [Vi U Vr}, we condition on the degree sequence 
d of this bipartite graph. Since we also condition on the event V, the maximum degree is < In 2 n. Hence, 
we can generate the random bipartite graph with degree sequence d via the configuration model, and the 
probability that the resulting multigraph happens to be a simple graph is > exp(— 0(ln 4 n)). Thus, we just 
need to study a random configuration. 

Now, in a random configuration the probability that a vertex v G Vi has ^ In k neighbors in the set Wu 
is < k~ 10 , because the total number of edges of G' [Vi U Vr] is concentrated about n 2 pk~ 2 . Therefore, the 
(conditional) expected size of is < nk . Consequently, Azuma's inequality yields that with probability 
> 1 - exp(-J2(re)) the size of is < nk~ w . □ 

Finally, Lemma 16 follows immedately from the fact that Uu C U' a U U£. 



A.9 Proof of Theorem 1 

To prove the coloring part of Theorem 1 , we need to come up with an appropriate way to measure how 
"similar" two £>colorings of a given graph are G = G(n, to). A first idea may be to just use the Hamming 
distance. However, if we construct a coloring r simply by permuting the color classes of another coloring 
a, then a and r can have Hamming distance n, although they are essentially identical. Therefore, instead 
of the Hamming distance we shall use the following concept. Given two coloring a, r, we let M a jT = 
(M^ T )i<ij<fc be the matrix with entries 

M% T = n^(i)^T-\j)[. 

Then to measure how close r is to a we let 

Ur) = \\M a , T \\l=^2(M^ T f 

be the squared Frobenius norm of M CTjT . Hence, f a is a map from the set [k] n of /c-colorings to the interval 
[k~ 2 , / ct ((t)], where f a {a) > fc -1 . (Thus, the larger /o-(r), the more r resembles a.) Furthermore, for a 
fixed a G S(G) and a number A > we let 



9a,G,x(x) = \{t G [k] n : U(t) =xA H(t) < Xn}\. 



In order to show that S{G n ^ m ) with m = rn decomposes into exponentially many regions, we employ 
the following lemma. 

Lemma 20. Suppose that r > + £k)k In k. There are numbers k~ 2 < y\ < y2 < fc~ 1 ond A, 7 > such 
that with high probability a pair (G, a) G A n ^ m chosen from the distributoin lA n)m has the following two 
properties. 

1. For all x G [2/1,2/2] we have g a ,G,\(x) = 0. 

2. The number of colorings r G S(G) such that /o-(t) > 1/2 is at most exp(-jn) ■ \S(G)\. 

Let G = G n m be a random graph and call a G S(G) good if 1. and 2. hold. Then Lemma 20 states that 
with high probability a 1 — o(l)-fraction of all a G S(G) is good. Hence, to decompose S{G) into regions, 
we proceed as follows. For each a G S(G) we let 

C a = {r G 5(G) : / CT (r) > y 2 }. 

Then starting with the set S = S(G) and removing iteratively some C a for a good a G S from 5 yields an 
exponential number of regions. Furthermore, each such region C a is separated by a linear Hamming distance 
from the set S(G) \ C a , because f a is continuous with respect to ?i~ 1 xHamming distance (that is, for any 
e > there is 5 > such that /,j(t) < e for all r G [k] n satisfying dist(cr, r) < 5n). Thus, the property 
stated in Theorem 1 follows from Lemma 20. 

To establish Lemma 20, we employ the planted model. 

Lemma 21. Suppose that r > + Sk)klnk. There are k~ 2 < yi < 1/2 < k" 1 and A, 7 > such that 
with probability at least 1 — exp(— i?(n)) a pair (G, a) G A n ^ m chosen from the distributoin V n ^ m the two 
properties stated in Lemma 20. 

Thus, Lemma 20 follows from Lemma 21 and Theorem 7. 

Proof of Lemma 21. The proof is based on the first moment method. Let a G [A:] 71 be a fixed assignment 
of colors to the vertices. We may assume that |cr _1 (i)| ~ n/k for all 1 < i < k, because all but an 
exponentially small fraction of all assignments in [k] n have this property. Further, let G be a graph with m 
edges such that a is a /c-coloring of G chosen uniformly at random from the set of all such graphs. A direct 
computation shows that for an assignment r G [k] n the probability that H(t) < An is 

^ ( 1 ~ f^T^ ) rn exp((V>(A) + (l))n), (15) 

where lim^o Y^A) = 0. To prove the lemma, we shall compute the expected number of assignments r such 
that H(t) < An and /o-(t) = x for a suitable y\ < x < y2- 

To this end, we have to take into account the number of possible colorings r. We parameterize the set 
of all possible r by a matrix A = (aij)i<ij<fc, where a>ij = n~ 1 |o"~ 1 (i) n r _1 (j)|. Then by (15) the 
contribution of a matrix A to the first moment is at most 

HA) = AT" (, ; ) f 1 " 2 *" 1 ^/^ ) ^ exp((V(A) + o(l))n) 

(the £;~ n accounts for the fact that we consider the coloring a fixed). Taking logarithms, we obtain 

k (\ - 2k' 1 + y k a 2 \ 
n~ l \nF{A) ~ - In k - Oij M<Hj) + r ln 1 _ k pr^ ~ J + ^( A )- 



For a given number x we let A(x) be the set of all matrices A = {a-ij)i<i,j<k such that ajj > 0, X^f=i a y ~ 
k~ l , and X^'=i = x - Since there are at most n k ' z possible matrices A, for any given x the expected 
number of colorings r such that /<t(t) = x is at most 

n fc2 max F(A). 

Hence, by continuity it suffices to show that for some x £ [yi, 7/2] the expression ji" 1 max^g^^) ^^(A) 
is strictly negative for a small enough A > 0. 

Let h = k~ 3 / 2 and x = k~ 1 —2h. Then Theorem 9 from [4] shows that the maximum max^g^^^ In !F{A) 
is attained for a matrix A with entries 

an = - h + o(h), mj ~ h(k - l)" 1 (i ^ j) 

asymptotically as k grows. An explicit computation shows that for this matrix A the value ln^^l) is strictly 
negative, provided that A is sufficiently small. Therefore, we can apply Markov's inequality to complete the 
proof. □ 

B Proofs for Random fc-SAT 
B.l The planted model 

Consider the distribution U nfln on the set A n ^ m of pairs (F, a), where F is a fc-SAT formula with variables 
X\, . . . , x n and with m clauses, and a is a satisfying assignment of F. 

Ul. Generate a random formula F = Fk(n, m). 

U2. Sample a satisfying assignment a of F uniformly at random; if F is unsatisfiable, fail. 
U3. Output the pair (F, a). 

To analyze this distribution, we consider the distribution V n ,m on A n ^ m induced by following experiment. 
PI. Generate a random assignment a G {0, l} n . 

P2. Generate a random fc-CNF formula F with m clauses chosen uniformly among those satisfied by a. 
P3. Output the pair (F, a). 

Our goal is to establish the following connection between these two distributions. 

Theorem 8. There is a sequence £k — > such that the following holds. Let m = \rn\ for some r < 
(1 — £fc)2 fc In 2, and let f(n) be any function that such that linij^oo f(n) = 00. Let T> be any property such 
that Fk(n, m) has T> with probability 1 — o(l), and let £ be any property of pairs (F, a) G A n ^ m . If for all 
sufficiently large n we have 

P*v n , m [(F, a) has £\F has V] > 1 - exp(-k2 3 ~ k n - f(n)), (16) 

then Vr Un m [(F, a) has £} = 1 - o(l). 

The proof of Theorem 8 is based on the following lemma, which we establish in Section B.2. 

Lemma 22. Let \i = 2 n (l — 2~ k ) m denote the expected number of satisfying assignments of a random 
k-CNF Ffc(n, m). Then for k > 8, w.h.p. 

\S(F k (n,m)\ > nexp(-k2 3 - k n) . 



Proof (Theorem 8). Assume for contradiction that there is a fixed a > such that Pr^ n m [(F, a) has £] > a 
for infinitely many n. Then Lemma 22 implies that the set L = A n ^ m \ £ has size 



\L\ > ^/xexp(-A:2 3 - fc n) 



in 



|exp(-A:2 3 - fc n)|yl n , m |. (17) 



On the other hand, as V nm is just the uniform distribution on the set A riirn , (16) implies that 

\L\ < exp(-A;2 3 - fc n - f(n))\A n , m \. 
As f(n) — > oo, this contradicts (17) for sufficiently large n. □ 

B.2 Proof of Lemma 22 

Let Ab be the function defined by 



A b (l/2,k,r)=4 

where e satisfies 



((l-e/2) fc -2- fc )' 



(18) 



e(2-e) fc_1 = 1 . (19) 

Lemma 23. Suppose that r < 2 k In 2 — k. Then F^n, rn) has at least (Ab(l/2, k, r) — o(l)) n / 2 satisfying 
assignments w.h.p. 

Recall that Fk(n, m) denotes a random A;-SAT formula on n variables xi, . . . , x n . For a fixed number 
B > 1 we let Ab denote the property that a fc-SAT formula F on the variables xi,...,x n has less than \B n 
satisfying assignments. The following lemma shows that Ab has a shaip threshold. 

Lemma 24. For any B > 1 there is a sequence (T^) n >i of integers such that for any e > we have 

lim Pr(Ffc(n, (1 — e)T r f ) /ias property Ab) = 0, ant/ 

n— >oo 

lim Pr(Ffc(n, (1 + e)T^) has property Ab) = 1- 

n— >oo 

Assuming Lemma 24, we can infer Lemma 23 easily. 

Proof (Lemma 23.). Let r < 2 k In 2 — k. Equations (18) and (19) show that p yt 6 (l/2, k, p) is a con- 
tinuous function. Therefore, for any e > there is a < 5 < 2 fc ln2 - k - r such that r' = (1 + S) 2 r 
satisfies 

A(l/2,fc,r') > 4,(1/2, A;, r) - e. 



Let -B = ^Af,(l/2, k, r'). Setting t = \B n , the second moment argument from [6] shows in combination 
with the Paley-Zigmund inequality that 

lim infj^oo Pr[Ffc(n, r'n) does not satisfy Ab] > 0. 

Therefore, Lemma 24 implies that r'n < (1 + 5)T^ for sufficiently large n. Consequently, for large n we 
have rn = (1 + 5)~ 2 r'n < (1 + 5)~ 1 Tlf . Hence, Lemma 24 yields 

lim Pr[Ffc(n, rn) does not satisfy Ab] = L 



Thus, with probability 1 — o(l) the number Z of satisfying assignments of Fk(n, rn) satisfies 



Z > -B n = -A b (l/2, k, r'T /2 > 2 (A(l/2, k, r) - e) n l\ 



Since this is true for any e > 0, the assertion follows. 



□ 



Proof (Lemma 22.). As shown in [6], the solution e to (19) satisfies 



2 l ~ k + U~ k < e < 2 1_fe + 3A;4 



-k 



(20) 



Plugging these bounds into (18) and performing a tedious but straightforward computation, we obtain that 



To prove Lemma 24, we need a bit of notation. If is a formula on a set of variables yi,...,yi disjoint 
from xi, . . . , x n , then we let E n {4>) denote the set of all formulas that can be obtained from <ft by substituting 
/ distinct variables among x\, . . . , x n for yi, . . . , y\. Moreover, for a formula F on x\ , . . . , x n we let F(B<fi = 
F A <p*, where <ft* is chosen uniformly at random from E n {4>). 

Note that Ab is a monotone property, i.e., if F has the property Ab and F' is another formula on 
the variables x±, . . . , x n , then F A F 1 has the property Ab as well. Therefore, we can use the following 
theorem from Friedgut [15] to prove by contradiction that Ab has a sharp threshold. Let oj(n) = [logn] 
for concreteness. 

Theorem 9. Suppose that A b does not have a sharp threshold. Then there exist a number a > 0, a formula 
(j), and for any uq > numbers N > no, M > and a formula F with variables x±, . . . , xn such that the 
following is true. 

Tl. Pr(F © 4> has the property Ab) > I — a. 

T2. a < Pr(F k (N, M) has the property A B ) < 1 - 3a. 

T3. With probability at least a a random formula F^ (N, M) contains an element of En (<j>) as a subformula. 
T4. Pr(F A F k (N, 2u(N))) has the property A B ) < 1 - 2a. 

In the sequel we assume the existence of a, (p, N, M, and F satisfying conditions T1-T3. To conclude 
that A b has a sharp threshold, we shall show that then condition T4 cannot hold. Clearly, we may assume 
that N is sufficiently large (by choosing ?iq appropriately). Let V = {x±, . . . , xn}- 

Lemma 25. The formula (j) is satisfiable. 

Proof. Any /c-SAT formula that contains at most as many clauses as variables is satisfiable. Hence, to es- 
tablish the lemma, we will show that the probability Q that F^(JV, M) contains a subformula on I variables 
with at least / clauses is smaller than a; then the assertion follows from T3. 

To prove this statement, we employ the union bound. There are ( , ) ways to choose a set of I variables, 
and (Y) ways to choose slots for the I clauses of the subformula. Furthermore, the probability that the 
random clauses in these / slots contain only the chosen variables is at most (l/N) kl . Hence, the probability 
that Fk (N, M) has / variables that span a subformula with at least I clauses is at most 



-]nA b (l/2,k,r) > ln2 + r ln(l - 2~ k ) - k2 



,3-2fc 



Since r < 2 k , the assertion thus follows from Lemma 23. 



□ 




(21) 



Further, T2 implies that M/N < 2 k , because for M/N > 2 k the expected number of satisfying assignments 
of Fk(N,M) is less than 1. Thus, assuming that N is sufficiently large, we see that (21) implies Q < 
(e 2 {2l) k /N) 1 < a, as claimed. □ 

Thus, fix a satisfying assignment a : {y\, . . . , y{\ — > {0, 1} of <fi. Then we say that a satisfying assign- 
ment x of F is compatible with a tuple (z\, . . . , z{) G V 1 if x( z i) = f° r all 1 < i < /. Furthermore, 
we call a tuple (z\, . . . ,zi) G V 1 bad if F has less than ^B N satisfying assignments x that are compatible 
withOi,...,^). 

Lemma 26. There are at least (1 — a)N l bad tuples. 

Proof. The formula F © <fi is obtained by substituting I randomly chosen variables (z±, . . . ,zi) G V 1 for the 
variables yi, . . . , y\ of and adding the resulting clauses to i 7 . Since by Tl with probability at least 1 — a 
the resulting formula has at most \B N satisfying assignments, a uniformly chosen tuple (zi, . . . ,z{) G V 
is bad with probability at least I — a. Thus, there are at least (1 — a)N l bad tuples. □ 

Lemma 27. With probability at least 1 — a a random formula Fk(N, u>(N)) contains I clauses C\,...,C\ 
with the following two properties. 

Bl. For each 1 < i < I there is a k-tuple of variables (vj, . . . , v k ) G V k such that Ci = v\ V • • • V vf if 

a(i) = 1, and Ci = ^vj V • • • V ->vf ifa(i) = 0. 

f(D f(l) 

B2. For any function f : [I] — > [k] the l-tuple {v\ , . . . , vi w ) is bad. 

The proof of Lemma 27 is based on the following version of the Erdos-Simonovits theorem(cf. [15, Propo- 
sition 3.5]). 

Theorem 10. For any 7 > there are numbers > such that for any v > vq and any set He [u] 1 

of size \H\ > 7^* the following is true. If I k-tuples (w\, . . . ,w k ), . . . , (wj, ... ,wf) G [v] k are chosen 
uniformly at random and independently, then with probability at least 7' for any function /:[/]—» [k] the 

tuple (w\ , . . . , w\ 1 ) belongs to H. 

Proof (Proof of Lemma 27). Assuming that N is sufficiently large, we apply Theorem 10 to 7 = 1 — a, 
v = N, and the set H C [N] 1 of bad /-tuples. Then by Lemma 26 we have \H\ > jN l . Now, consider I 
random fc-clauses C\ , . . . , C\ over the variable set V chosen uniformly and independently. Let V\ , . . . , V; be 
the /c-tuples of variables underlying C±, . . . , C\. Then Theorem 10 entails that V\,...,Vi satisfy condition 
B2 with probability at least 7'. Moreover, given that this is the case, condition Bl is satisfied with probability 
2~ kl . Therefore, the clauses C±,...,Ci satisfy both Bl and B2 with probability at least r y'2~ kl . Hence, 
the probability that F^(N ,uj(N)) does not feature an /-tuple of clauses satisfying Bl and B2 is at most 
(1 — -j'2~ kl ) L^( Ar )/'J . Since u(N) = [log N] , we can ensure that this expression is less than a by choosing 
N large enough. □ 

Corollary 3. With probability at least 1 — a the formula F A Fj t (N, u>(N)) has at most \k l ■ B N satisfying 
assignments. 

Proof. We will show that if C\, . . . , C[ are clauses satisfying the two conditions from Lemma 27, then 
F A C\ A • • • A Ci has at most ^k l B N satisfying assignments. Then the assertion follows from Lemma 27. 

Thus, let x be a satisfying assignment of F A C\ A • • • A G\. Then by the Bl for each 1 < i < I 
there is an index f x (i) G [k] such that x{ v { x ^) = Moreover, by B2 the tuple (v[* , . . . , vf x ^) 



is bad. Hence, the map \ l— > fx e [^f yields a bad tuple {v\ )i<i<l for each satisfying assignment. 
Therefore, the number of satisfying assignments mapped to any tuple in [k] 1 is at most \B n . Consequently, 
F A C\ A • • • A C\ has at most \k ■ B n satisfying assignments in total. □ 

Corollary 4. With probability at least 1 — |a the formula F A Fk(N, 2ui) satisfies As- 
Proof The formula F** = F A Fk(N, 2uj) is obtained from F by attaching 2uo{N) random clauses. Let 
F* = F A Fk(N,Lj(N)) be the formula resulting by attaching the first u(N) random clauses. Then by 
Corollary 3 with probability at least 1 — a the formula F* has at most \k l ■ B N satisfyng assignments. 
Conditioning on this event, we form F** by attaching another oj(N) random clauses to F*. Since for any 
satisfying assignment of F* the probability that these additional lo(N) clauses are satisfied as well is (1 — 
2~ k ) w ( N \ the expected number of satisfying assignments of F** is at most 

X -k l -B N -(1- 2 ~ k r^ <j B N , 
provided that N is sufficiently large. Therefore, Markov's inequality entails that 

Pr(F** violates Ab\F* has at most \k l ■ B N satisfying assignments) < a/2. 
Thus, we obtain 

Pr(F** violates Ab) < Pr(F* has more than ^k l ■ B N satisfying assignments) 

+ Pr(F** violates Ab\F* has at most ^k l • B N satisfying assignments) < 3a/2, 

as desired. □ 

Combining Theorem 9 and Corollary 4, we conclude that A b has a sharp threshold, thereby completing the 
proof of Lemma 24. 

B.3 Proof of Theorem 2 

Using Theorem 8, we shall establish the following lemma. 

Lemma 28. There exist numbers < ol\ < «2 < 3. A > 0, and 7 > such that a random pair (F, a) 
chosen from the distribution U n ^ m has the following two properties w.h.p. 

1. Any assignment r such that a± < n~ 1 dist(a, r) < 02 satisfies H(r) > An. 

2. \{t e S{F) : dist(a,r) < (3 2 n}\ < 2 n (l - 2- fe ) m exp(- 7 n - k2 3 ~ k n). 

Proof (Theorem 2). Let F = Fk(n,m) be a random /e-SAT instance. To each assignment a G 5(F) we 
assign the set 

C a = {t G S(F) : dist(a,r) < a 2 n}. 
Due to Lemma 22, a similar argument as in the proof of Theorem 1 in Section A.9 yields Theorem 2. □ 

Let A > be small but fixed. Let F = Fk(n, m) be a random /c-SAT formula with m = rn clauses. 
Then for any a G {0, l} n we have 

n _1 InPr [a is satisfying] = r ln(l — 2~ k ), 



because of the independence of all m clauses. Furthermore, if r € {0, l} n is a second assignment at Ham- 
ming distance an from a, then 

rT l InPr [a, r are both satisfying] = r ln(l - 2 1 ~ k + 2~ k (l - a) k ). 
Indeed, there is a function <^(A) such that lim^o !?(A) = and 

n _1 lnPr[iI(o-) =0 A-ff(r) < An] = r [ln(l - 2 1 ~ fc + 2~ fc (l - a) fc ) + <?(A) . 

Let X q a signify the number of assignments r at Hamming distance an from a such that i?(r) < An. 

Lemma 29. There is a number < a* < 1/3 such that for sufficiently small A > we have 

n _1 lnE [X a * } x\a is satisfying] < -k2 3 ~ k . 

Proof. There are f ") ways to choose an assignment r at Hamming distance an from cr. Therefore, due to 
the formulas derived above, we have 



n 1 In E [X at a | o is satisfying] = — a In a— (1— a) m(l— a)+r 



I- 1 



Setting a* = (k In k) 1 and simplifying, we obtain the assertion. □ 

Corollary 5. There are numbers A > and < a\ < o?2 < 3 such that with probability at least 1 — 
o(exp(— &2 3 ~ fc n) in a /?az'r (F, a) E vl n m chosen from the distribution V n ^ m there is no assignment r such 
that such that H(t) < An and a\ < n~ 1 dist(a, r) < 02- 

Proof. If (F, a) is chosen from V n ,m, then F is distributed as a random formula F^(n, m) given that a is a 
satisfying assignment. Therefore, the corollary follows from Lemma 29 and Markov's inequality, where we 
use the fact that a 1— > n _1 InE [X a t \\a is satisfying] is a continuos function. □ 

Furthermore, the following estimate has been established in [7]. 

Lemma 30. We have max 0<Q , < i n _1 InE [X a ^\\a is satisfying] < ln2 + r(l — 2~ k ) — 2exp(fc2 3 ~ fc ). 

Finally, Lemma 28 follows from Theorem 8 in combination with Corollary 5 and Lemma 30. 

B.4 Proof of Theorem 4 (fc-SAT) 

If F is a fc-SAT formula and a an assignment, then we say that a variable x supports a clause C if changing 
the value of x would render C unsatisfied. Suppose that k is sufficiently large and (1 + e)2 k k~ l In k < r = 
m/n < (1 — e)2 k In 2. Let 7, 8 > be sufficiently small numbers. 

Lemma 31. A pair (F, a) chosen from li n ^ m has the following property w.h.p. 

There is a set U of at least (1 — 5)n variables such that each variable in U supports 7 In k clauses 
e that contain no variable from V \U. 



Proof (Theorem 4). Let £ > signify a sufficiently small constant. Let (F, a) be chosen from the distribu- 
tion lA n , m . We may assume that the random pair (F, a) satisfies (22). Moreover, a 1st moment computation 
shows that the random formula F has the following property w.h.p. 

There is no set Z of variables of size \Z\ < (n such that F features at least |Z|7ln/c clauses e 
that contain at least two variables from Z. 

Now, assume for contradiction that there is a satisfying assignment r of F such that the set Z = {v G U : 
t(v) / cr(v)} has size 1 < \Z\ < (n. Each v G Z supports in a at least 7 mA; clauses e that contain no 
variable from V\U. Since these clauses e are satisfied in t, although t(z) = 0, each such e contains another 
variable w 7^ v from Z. Hence, F contains at least \Z[y In k clauses e containing at least two variables from 
Z, in contradiction to (23). □ 

Lemma 3 1 is an immediate consequence of Theorem 8 and the following lemma. 

Lemma 32. A pair (F, a) chosen from V n:m has the property (22) with probability 1 — o(exp(— k2 3 ~ k n)). 

Proof. We may assume that r = m/n = (1 + e)2 k k^ 1 \nk for a fixed e > 0. Moreover, without loss of 
generality, we may assume that the assignment a sets all variables V = {x\, . . . , x n } to true. Let F denote 
a random formlua with m clauses satisfied by a, and let S signify the set of all uniquely satisfied clauses of 
F. Consider the following process. 

1 . Let Zq be the set of all variables x that support fewer than 27 In k clauses. 

2. Let Z = Zq. While there is a variable x G V \ Z that supports at least 7 In k clauses from S that contain 
a variable from Z, add x to Z. 

The expected number of uniquely satisfied clauses is at least k2~ k m > (1 + e)nlnfc. Hence, each 
variable is expected to support at least (1 + e) Ink clauses. Therefore, if 7 > is sufficiently small, then 
there is a contant (3 > such that the probability that a variable x supports fewer than 27 In k clauses 
is at most k" 1 " 13 . Hence, by Chernoff bounds we have \Zq\ < nk~ x ~PI 2 with probability at least 1 — 

o(exp(— k2 3 ~ h n)). 

Thus, assume that \Zq\ < nk~ l ~^/ 2 . We claim that then the final set Z resulting from Step 2 has 
size at most \Z\ < 2n£;~ 1 ~ /3 / 2 . For assume that \Z\ > 2nk~ 1 ~^l 2 . Then Step 2 removed at least \Z\j2 
variables, whence there are at least 7ln/c|Z|/2 clauses e G S that contain two variables from Z. However, 
a standard 1st moment argument shows that the probability that there exists a set Z with this property is 
o(exp(— k2~ k n)). Hence, with probability at least 1 — o(exp(— k2~ k n) we have \Z\ < 2nk~ 1 - 13 / 2 . Setting 
U = V\ Z concludes the proof. □ 



